Instance-Family Abstraction in Memory-Based Language Learning
نویسنده
چکیده
Memory-based learning appears relatively successful when the learning data is highly disjunct, i.e., when classes are scattered over many small families of instances in instance space, as in many language learning tasks. Abstraction over borders of disjuncts tends to harm generalization performance. However , careful abstraction in memory-based learning may be harmless when it preserves the disjunctivity of the learning data. We investigate the eeect of careful abstraction in a series of language-learning task studies , and a small benchmark-task study. We nd that when combined with feature weight-ing or value-distance metrics, careful abstraction , as implemented in the new fambl algorithm , can equal the generalization accuracies of pure memory-based learning, while attaining fair levels of memory compression.
منابع مشابه
Careful abstraction from instance families in memory-based language learning
Empirical studies in inductive language learning point at pure memory-based learning as a successful approach to many language learning tasks, often performing better than lerning methods that abstract from the learning material. The possibility is left open, however, that limited, careful abstraction in memory-based learning may be harmless to generalisation, as long as the disjunctivity of la...
متن کاملExceptionality and Natural Language Learning
Previous work has argued that memory-based learning is better than abstraction-based learning for a set of language learning tasks. In this paper, we first attempt to generalize these results to a new set of language learning tasks from the area of spoken dialog systems and to a different abstraction-based learner. We then examine the utility of various exceptionality measures for predicting wh...
متن کاملForgetting Exceptions is Harmful
We show that in language learning, contrary to received wisdom, keeping exceptional training instances in memory can be beneecial for generalization accuracy. We investigate this phenomenon empirically on a selection of benchmark natural language processing tasks: grapheme-to-phoneme conversion, part-of-speech tagging, prepositional-phrase attachment, and base noun phrase chunking. In a rst ser...
متن کاملIRDDS: Instance reduction based on Distance-based decision surface
In instance-based learning, a training set is given to a classifier for classifying new instances. In practice, not all information in the training set is useful for classifiers. Therefore, it is convenient to discard irrelevant instances from the training set. This process is known as instance reduction, which is an important task for classifiers since through this process the time for classif...
متن کاملEmotioncy-Based Language Instruction: A Key to Enhancing EFL Learners' Vocabulary Retention
Given the vital role that vocabulary plays in learning a language along with the fact that sense-induced emotions are noteworthy due to their influence on cognition, this study examined the impact of emotioncy-based language instruction on the long-term memory (LTM) vocabulary retention of Iranian EFL learners. To this end, 32 participants were selected based on their level of trait/state anxie...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999